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Field of the Invention 



oligourea containing the basic-arginine rich region of 
Tat, the method of production of this oligourea and the 

0 use thereof. In particular, this invention relates to 
the design of drugs comprising the oligourea backbone of 
the invention, further comprising amino acid side chains. 
Similarly, the DNA-binding oligourea of the invention can 
also be synthesized to control biological processes 

5 involving DNA-protein interactions 

Background of the Invention 



referred to in brackets and footnotes throughout the 
0 specification. These articles are incorporated by 
reference herein to describe the state of the art to 
which this invention pertains. Full citations of the 
references appear at the end of the specification. 



5 in many cellular functions such as transcription, RNA 

splicing, and translation. Small peptides with unnatural 
backbones that can bind with high affinity to a specific 
sequence or structure of nucleic acids and interfere with 
protein-nucleic acid interactions would provide useful 

0 tools in molecular biology and medicine. Recently, 

minor-groove-binding polyamide ligands have been designed 
for sequence-specific recognition of DNA. 1 In contrast to 
DNA, RNA molecules can fold into extensive structures 



This invention relates to a synthesized 



Various scientific and scholarly articles are 



Protein-nucleic acid interactions are involved 
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containing regions of double-stranded duplex, hairpins, 
internal loops, bulged bases and pseudo-knotted 
structures, 2 The complexity of RNA structure makes it 
difficult to design ligands for sequence-specific RNA- 
5 recognition. Three-dimensional structures of RNA create 
binding sites for specific interactions with proteins. 

One example of such interactions is the 
mechanism of trans-activation of human immunodeficiency 
virus type 1 (HIV-1) gene expression that requires the 

10 interaction of Tat protein with the trans-activation 
responsive region (TAR) RNA, a 59-base stem-loop 
structure located at the 5 '-end of all nascent HIV-1 
transcripts. 3 Replication of human immunodeficiency virus 
type 1 (HIV- 1) requires specific interactions of Tat 

15 protein with the TAR RNA. Inhibition of Tat-TAR 
interactions is a potential approach for anti-HIV 
therapeutics. Since structural information is now 
available for TAR RNA and TAR-Tat peptide complexes from 
NMR 4 , photocrosslinking, 5 and affinity cleaving studies, 6 

20 it is possible to design small molecules to interfere 

with Tat-TAR function. We have recently begun to examine 
TAR RNA recognition by unnatural biopolymers . 7 

Objects of* the Invention 

25 It is an object of the invention to provide 

substances which have higher binding affinities for RNA 
than natural peptides, which are resistant to proteases 
and which can interact with nucleic acids in a fashion 
similar to natural peptides. Such substances can be used 

30 to inhibit protein-nucleic acid interactions important 
for cellular processes. 

It is a further object of the invention to 
provide a substance inhibiting protein-nucleic acid 
interactions. In particular, it is an object of the 

35 invention to provide a substance which controls 
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biological processes involving DNA-protein interactions, 
and which inhibit transcription in HIV-1 infected cells. 
Such a substance leads to the design of drugs based on 
the substance . 
5 The current invention comprises a novel 

synthesized oligourea containing the basic-arginine rich 
region of Tat. The oligourea of the invention shows 
specific recognition of HIV-1 TAR RNA. 

Other ob j ects and advantages of the invention 
10 will become apparent to those skilled in the art from the 
accompanying description of the invention. 

Brief Description of the Drawings 

Figure 1A: The Tat-derived peptide, amino acids 

15 48 to 57, contains the RNA-binding domain of Tat protein 
(SEQ ID NO:l). Figure IB: Structure of the generic 
oligourea backbone. Side-chains corresponding to a 
desired amino acid are substituted at the R x and R 2 
positions. Sequence of Tat-derived oligourea corresponds 

20 to the side-chains of the Tat peptide shown in (A) , 
except the addition of an L-Tyr amino acid at the 
carboxyl-terminus . Tat-derived oligourea was synthesized 
on solid support by using activated p-nitrophenyl 
carbamates and azides of protected amines followed by 

25 reduction with SnCl 2 -thiophenol-triethylamine (Kim, J. M.; 
Bi, Y. Z.; Paikoff, S. J.; Schultz, P. G. Tetrahedron 
Lett. 1996, 37, 5305- 5308; Kick, B.; Bllman, J. J Med. 
Chem. 1995, 38, 1427-1430; incorporated by reference 
herein) . After cleavage from the resin, the oligourea 

30 was purified by HPLC on a Zorbax 300 SB-C 8 column (Wang, 
Z.; Rana, T. M. J Am. Chem. Soc. 1995, 117, 5438-5444; 
Wang, Z.; Rana, T. M. Biochemistry 1996, 35, 6491-6499; 
Wang, Z.; Wang, x.; Rana, T. M. J Biol. Chem. 1996, 27, 
16995-16998; incorporated by reference herein) . The mass 
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of fully deprotected and purified oligourea was confirmed 
by ES and MALDI mass spectrometry; 1849,2 (M +H) . 

Figure 2A: Secondary structure of wild-type TAR 
5 RNA used in this study. Wild-type TAR RNA spans the 
minimal sequences that are required for Tat 
responsiveness in vivo 14 and for in vitro binding of Tat- 
derived peptides. 9 Wild-type TAR contains two non-wild- 
0 type base pairs to increase transcription by T7 RNA 

5|;'f 10 polymerase. Mutant MO TAR contained no bulge residue in 
fp;!i its sequence. In mutant G2 6C, a base-pair in the upper 

% stem of TAR RNA, G26-C39 was substituted by C26-G39. 

i!W Figure 2B: Electrophoretic mobility shift analysis for 

[Pi I 

^ the Tat-derived oligourea binding to wild-type and 

Q 15 trinucleotide bulge mutant (MO) TAR RNA. 5 '-end labeled 

ju TAR RNAs (40 nM) were heated to 85 C for three minutes 



mm and then cooled to room temperature in TK buffer (50 
ijlj mM Tris-HCI pH 7.4), 20 mM KCI, 0.1% Triton X-100) . The 

oligourea (150 nM) was added to wild-type or mutant TAR 
20 and incubated at room temperature for one hour. After 
adding 30% glycerol, the oligourea-RNA complexes were 
resolved on a non-denaturing 12% acrylamide gel and 
visualized by autoradiography or phosphorimaging. Figure 
2C: Specificity of the oligourea-TAR complex formation 
25 determined by competition experiments. Oligourea-RNA 
complexes were formed in the presence of increasing 
concentrations of unlabeled wild-type or mutant TAR RNAs. 
Concentrations of the competitor RNAs in lanes 3, 4, 5, 6 
were 50, 100, 150, and 200 nM, respectively. Lanes 1 and 
30 2 were marker lanes showing RNA and oligoureaRNA 

complexes. Oligourea-RNA complexes are labeled as R-P. 



35 



Figure 3. Site-specific photocrosslinking 
reaction of TAR RNA labeled with 4-thioUracil at position 
23 with the oligourea. For photochemical reactions, RNA 
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duplex was prepared by hybridizing two strands. 5 ' 7 Strand 
1 of the duplex was 5' -end labeled with 32p Preformed RNA 
duplexes (40 nM) in the absence or presence of the 
oligourea (100 riM) were irradiated (360 urn) and analyzed 
5 by denaturing gels as described earlier. 5,7 Proteinase K 
digestion was performed at 55 °C for fifteen minutes 
after LW irradiation. R-R and R-P XL indicate the RNA- 
RNA and RNA-oligourea crosslink, respectively. 

10 Figure 4. Inhibition of Tat transactivation by 

the oligourea derivative in vivo. CAT activity expressed 
from the integrated HIV-1 LTR of HL3TI Cells with 
increasing amounts of oligourea is shown. Luciferase 
activity was a control experiment to monitor the 

15 transfection inhibition of pSV2Tat by the addition of 

oligourea. CAT and Luciferase activities were measured 
from multiple experiments and normalized to 100%. 
Control lane (labeled as positive) shows Tat 
transactivation in the absence of oligourea. 

20 

Detailed Description of the Invention 

The invention provides a composition and method 
for inhibiting the interaction of a nucleic acid and 
specific binding protein in vitro and in vivo. The 

25 composition is an oligourea backbone as disclosed in 

figure IB, with amino acid side-chains substituted at the 
Ri and R 2 positions. In accordance with the invention, it 
has been discovered that the rigid and protease 
insensitive oligourea backbone, when substituted with a 

30 sequence of amino acid side-chains modeled after a known 
nucleic acid binding domain, will mimic the nucleic acid 
binding domain in specificity, but with a much lower 
disassociation constant. This nucleic acid binding 
composition may be used for research into the 

35 physiological effects of nucleic acid binding proteins, 
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assay methods for detecting nucleic acids and therapeutic 
methods for inhibiting protein-nucleic acid interactions 
that lead to disease states. Also provided, is a method 
for inhibiting protein-nucleic acid interactions in vitro 

5 and in vivo which entails introducing the oligourea 
molecules of the invention. 

The composition of the invention is composed 
oligourea backbone, the generic form of which is 
disclosed in Fig, IB, which supports the side chains of 

0 amino acids, at the R 2 and R 2 position of Fig. 2B. When 
the oligourea molecule has amino acid side chains that 
correspond to the side chains of a nucleic acid binding 
protein in composition and sequence, the oligourea 
molecule then binds to the target nucleic acid 

5 specifically and with a very low disassociation constant. 
In a preferred embodiment, the oligourea molecule has a 
dissociation constant upon binding the target nucleic 
acid of less than or equal to 0.7 yM (less than or equal 
to 0.5 more preferred; less than or equal to 0.3 yM 

0 most preferred) . In Example 1, the use of an oligourea 
molecule of the invention is illustrated which mimics the 
RNA-binding protein Tat. In a preferred embodiment, the 
oligourea molecule is comprised of amino acid side chains 
that mimic the Tat molecule. In a more preferred 

5 embodiment, the side-chains correspond to residues 48 - 
57 of the Tat molecule, more preferred, SEQ ID NO:l. In 
a most preferred embodiment, the amino acid side-chains 
correspond to SEQ ID NO:l with a L-Tyr amino acid at the 
carboxyl-terminus . 

0 The composition of the invention encompasses a 

very diverse assortment of molecules, all with oligourea 
backbones and amino acid side-chains. The oligourea 
molecule may be any length which achieves the desired 
dissociation constant from the nucleic acid. In a 

5 preferred embodiment, the oligourea is 3 to 50 urea-units 
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long (5 to 30 more preferred, 8 to 25 most preferred) . 
The oligourea molecule may comprise amino acid side- 
chains that correspond to the binding region of any 
nucleic acid binding protein presently known or that will 

5 be discovered. Types of DNA binding proteins of interest 
include, but are not limited to, transcription control 
proteins (e.g. transcription factors, see Conaway and 
Conaway, 1994, Transcription Mechanisms and Regulation , 
Raven Press Series on Molecular and Cellular Biology, 

0 Vol. 3, Raven Press, Ltd., New York, NY), recombination 
enzymes (e.g. hin recombinase) , DNA modifying enzymes 
(e.g. restriction enzymes), structural proteins (e.g. 
histones and nonhistone chromatin proteins such as HMG 
proteins), single-stranded DNA-binding proteins (e.g. 

5 those involved in the propagation of a DNA replication 

fork or in the packaging of T-DNA ssDNA) and double- and 
single-stranded RNA-binding proteins. RNA-binding 
proteins are also contemplated in regard to the present 
invention, (see, for example, Draper DE, J Mol Biol 1999 

0 Oct 22;293 (2) : 255-70; Haile DJ, Am J Med Sci 1999 

Oct; 318 (4 ): 230-40; Cusack S, Curr Opin Struct Biol 1999 
Feb; 9(1): 66-73) . 

Transcription factors suitable for use with the 
present invention include, but are not limited to, 

5 homeobox proteins, zinc finger proteins, hormone 

receptors, helix- turn-helix proteins , helix- loop-helix 
proteins, basic-Zip proteins (bZip) and (3-ribbon factors 
(see Harrison, 1991, Nature 353:715-719). Homeobox 
DNA-binding proteins contemplated for use with the 

0 instant invention include, but are not limited to, HOX, 
STF-1 (Leonard et al., 1993, Mol. Endo . , 7:1275-1283; 
Scott et al. (1989), Biochem. Biophys . Acta, 989:25-48), 
Antp, Mat oc-2 and INV. * Zinc finger DNA-binding proteins 
contemplated for use with the instant invention include, 

5 but are not limited to, Zif268, GLI and XFin. For 
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reviews of zinc-finger DNA-binciing proteins see Klug and 
Rhodes (1987), Trends Biochem. Sci . , 12:464; Jacobs and 
Michaels (1990), New Biol., 2:583; and Jacobs (1992), 
EMBO J., 11:4507-4517. Hormone receptor DNA-binding 
5 proteins contemplated for use with the instant invention 
include, but are not limited to, glucocorticoid receptor, 
thyroid hormone receptor and estrogen receptor (see, 
e.g., U.S. Pat. Nos . 4,981,784; 5,171,671; and 
5,071,773). Helix-turn-helix DNA-binding proteins 

10 contemplated for use with the instant invention include, 
but are not limited to, A-repressor, cro-repressor , 434 
repressor and 434-cro (See, e.g., Pabo and Sauer, 1984, 
Annu. Rev. Biochem., 53:293-321). Helix-loop-helix 
DNA-binding proteins contemplated for use with the 

15 instant invention include, but are not limited to, MRF4 
(Block et al., 1992, Mol . and Cell Biol., 
12 (6) :2484-2492) , CTF4 (Tsay et al . , 1992, NAR, 
20 (10) :2624) , NSCL, PAL 2 andUSF (see, for review, Wright 
(1992), Current Opinion in Genetics and Development, 

20 2 (2) :243-248; Kadesch, T. (1992), Immun. Today, 

13(1): 31-36; and Garell and Campuzano (1991), Bioessays, 
13 (10) : 493-498) . Basic Zip DNA-binding proteins 
contemplated for use with the instant invention include, 
but are not limited to, GCN4, fos and jun (see, for 

25 review, Lamb and McKnight, 1991, Trends Biochem. Sci., 

16:417-422). (3-ribbon factors contemplated for use with 
the instant invention include, but are not limited to, 
Met- J, ARC, and MNT . 



30 a diverse range of uses. Any application that requires a 
strong nucleic acid binding molecule may use the 
oligourea molecules of the invention. The oligourea 
molecules may be used to inhibit the native nucleic acid 
binding molecule by competing with the native protein 

35 molecule for the binding site on the nucleic acid. This 



The oligourea composition of the invention has 
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application may be used for research purposes or for 
therapy purposes. In therapeutic methods, the oligourea 
molecules of the invention may be used to inhibit a 
protein-nucleic interaction that leads to a disease 
5 state. Example 1 illustrates the use of an oligourea 
molecule to inhibit the interaction between the Tat 
protein and the TAR RNA from HIV. Finally, the oligourea 
molecule may be used to detect the presence of the target 
nucleic acid molecule in any method that requires the 
10 detection and/or quantization of a specific nucleic acid. 



specific interaction between a binding protein and its 
target nucleic acid comprising introducing an oligourea 
molecule that specifically competes with the binding 

15 protein for the binding site on the target nucleic acid. 
In a preferred embodiment, the method is a therapeutic 
method for patients in need of such a treatment. This 
method is particularly suited as therapeutic method 
because of the high specificity of the inhibition 

20 provided. The therapeutic method is applicable to any 

disease state in which a nucleic acid-protein interaction 
affects the disease state. In a preferred embodiment, 
the patient is human. In a more preferred the patient is 
infected by the HIV-1 virus, and the oligourea molecule 

25 introduced comprises amino acid side chains that 
correspond to the Tat molecule. 



procedures involved in practicing the present invention. 
To the extent that specific materials are mentioned, it 
30 is merely for purposes of illustration and is not 
intended to limit the invention. Unless otherwise 
specified, general cloning procedures, such as those set 
forth in Sambrook et al . , Molecular Cloning , Cold Spring 
Harbor Laboratory (1989) (hereinafter "Sambrook et al." ) 



A method to inhibit the interaction between a 



The following Example sets forth the general 
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Biology , John Wiley & Sons (2000) (hereinafter *Ausubel 
et al.") are used. 

The following example is provided to describe 
the invention in greater detail. It is intended to 
5 illustrate, not to limit, the invention. 

Example I 

We synthesized an oligourea containing the 

C! 10 basic-arginine rich region of Tat by solid phase 

!p!J 

jjjjj synthesis methods, and tested for TAR RNA -binding. This 

yl' tat-derived unnatural biopolymer binds TAR RNA 

hi] 

jirj specifically with affinities higher than the wild-type 

^ Tat peptide. Site-specific photocrosslinking experiments 

g 15 using a photoactive analog ( 4- thio-uracil ) containing TAR 

P. !I 

RNA revealed that the unnatural biopolymer interacts with 
RNA in the major groove. The oligourea-RNA complexes 
were stable to proteolytic digestion. RNA recognition by 
an oligourea provides a new class of RNA-binding 
20 molecules that can be used to control cellular processes 
involving RNA-protein interactions in vivo. 

In this report, we synthesized an oligourea 
containing the basic-arginine rich region of Tat by solid 
phase synthesis methods, and tested for TAR RNA binding. 
25 Oligoureas have backbones with hydrogen bonding groups, 
chiral centers, and a significant degree of 
conformational restriction. Introducing additional side 
chains at the backbone NH sites can further modify 
biological and physical properties of these oligomers. 
30 This tat-derived unnatural biopolymer binds specifically 
to TAR RNA with affinities higher than the wild-type Tat 
peptide. These results identify a new class of unnatural 
peptides for structure-specific recognition of RNA. 

The promoter of HIV-1, located in the U3 region 
35 of the viral long terminal repeat (LTR) , is an inducible 
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promoter which can be stimulated by the trans-activator 
protein, Tat. 3 As in other lentiviruses, Tat protein is 
essential for trans-activation of viral gene expression. 8 
A number of studies showed that Tat-derived peptides 

5 which contain the basic arginine -rich region of Tat are 
able to form in vitro complexes with TAR RNA. 5 We 
synthesized a tat-derived oligourea containing the basic- 
arginine rich region of Tat protein by solid phase 
synthesis methods (Figure 1) . Recently, two methods have 

0 been reported for solid phase synthesis of oligourea. 10, 11 
To synthesize Tat-derived oligourea on solid support, we 
used activated p-nitrophenyl carbamates and protected 
amines in the form of azides, which were reduced with 
SnCl 2 -thiophenol-triethylamine on solid support. 11, 12 

5 After HPLC purification and characterization by mass 
spectrometry, the oligourea was tested for TAR RNA 
binding (Figure 2) . The tat-derived oligourea was able 
to bind TAR RNA and failed to bind a mutant TAR RNA 
without the bulge residues. 

0 Equilibrium dissociation constants of the 

oligourea-TAR RNA complexes were measured using direct 
and competition electrophoretic mobility assays, 13 
Dissociation constants were calculated from multiple sets 
of experiments which showed that the oligourea binds TAR 

5 RNA with a of 0.11 ± 0.07 fM. To compare the RNA- 

binding affinities of the oligourea to natural peptide, 
we synthesized a tat-derived peptide (Tyr47 to Arg57) 
containing the RNA-binding domain of Tat protein (Figure 
1) . Dissociation constants of the Tat peptide-RNA 

0 complexes were determined from multiple sets of 

experiments under the same conditions used for oligourea- 
TAR RNA complexes. These experiments showed that the Tat 
peptide (47-57) binds TAR RNA with a of 0.78 + 0.05 juM. 
A relative dissociation constant (K^J can be determined 

5 by measuring the ratios of wild-type Tat peptide to the 
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oligourea dissociation constants (K D ) for TAR RNA. Our 
results demonstrate that the calculated value for K REL was 
7.09, indicating that the urea backbone structure 
enhanced the TAR binding affinities of the unnatural 
5 biopolymer . 

Specificity of the oligourea-TAR RNA complex 
formation was addressed by competition experiments 
(Figure 2c) . Oligourea-RNA complex formation was 
inhibited by the addition of unlabeled wild-type TAR RNA 

10 and not by mutant TAR RNAs . Mutant TAR RNA without a 
trinucleotide bulge (Figure 2c) or with one base bulge 
(data not shown) was not able to compete for oligourea 
binding to wild-type TAR RNA. 

Two base -pairs immediately above the pyrimidine 

15 bulge are critical for Tat recognition . 9 To determine 
whether the oligourea recognizes specific base-pairs in 
the stem region of TAR RNA or only a trinucleotide bulge 
containing RNA, we synthesized a TAR mutant where the 
G26-C39 base pair was substituted by a C26-G39 base-pair 

20 (Figure 2a) . Competition experiments showed that this 
mutant TAR (G2 6C) did not inhibit Oligourea binding to 
TAR RNA (Figure 2c) . These results indicate that the 
tat-derived oligourea can specifically recognize TAR RNA. 

To probe the oligourea-RNA interactions and 

25 determine the proteolysis stability of oligourea, we 

synthesized TAR RNA containing 4-thioU at position 23 and 
performed photocrosslinking experiments as described 
earlier (Figure 3) . 5 ' 7 Irradiation of the oligourea-RNA 
complex yields a new band with electrophoretic mobility 

30 less than that of the RNA (lane 4) . Both the oligourea 
and UV (360 nm) irradiation are required for the 
formation of this crosslinked RNA-oligourea complex (see 
lanes 3 and 4). Since the crosslinked oligourea-RNA 
complex is stable to alkaline pH (9.5), high temperature 

35 (85 °C) and denaturing conditions (8M urea, 2% SDS) , we 
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conclude that a covalent bond is formed between TAR RNA 
and the oligourea during the crosslinking reaction. 

To test the protease stability of the 
oligourea-RNA complexes, we subjected the oligourea-RNA 
5 crosslink products to very vigorous proteinase K 

digestion which showed that the complexes were completely 
stable and there were no signs of oligo urea degradation 
(lane 5 and 6). Under similar proteinase K treatment, 
Tat-TAR photocrosslink products resulted in a complete 

10 loss of RNA-protein crosslink and a gain in free RNA as 
observed by band intensities on the gel. 6 . 

These findings show that a small tat-derived 
oligourea binds TAR RNA specifically with high affinity 
and interacts in the major groove (4-thio groups at U23) 

15 of TAR RNA, Due to the difference in backbone structure, 
oligoureas may differ from peptides in hydrogen-bonding 
properties, lipophilicity, stability, and conformational 
flexibility. Moreover, oligoureas are resistant to 
proteinase K degradation. These characteristics of 

20 oligoureas may be useful in improving pharmacokinetic 

properties relative to peptides. RNA recognition by an 
oligourea provides a new approach for the design of drugs 
which will modulate RNA-protein interactions. 
Transfection enhancing agents could be utilized with 

25 drugs comprising the oligourea of the invention to 

ameliorate any problems associated with the transfection 
or uptake of the oligourea of the invention. 
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